Automatic Text Browsing Using Vector Space Model

نویسندگان

  • Amit Singhal
  • Gerard Salton
چکیده

Vast amounts of text are now available in machine-readable form and can be processed electronically. The vector space model of text processing has been widely used and has consistently produced superior retrieval results for the last thirty years. Traditionally , information retrieval research has concentrated on improving on-demand retrieval of useful textual information. Often a user has no particular information need; instead, he is just interested in browsing the available information. Most hypertext systems depend on manually created browsing structures to support text browsing. Manual text linking does not scale up to the amount of electronic information accessible in today's networked computing environment. The vector space model can be used to automatically impose a exible structure on arbitrary texts. Tools can then be provided for eeective browsing of this structured text.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Emotion Detection in Persian Text; A Machine Learning Model

This study aimed to develop a computational model for recognition of emotion in Persian text as a supervised machine learning problem. We considered Pluthchik emotion model as supervised learning criteria and Support Vector Machine (SVM) as baseline classifier. We also used NRC lexicon and contextual features as training data and components of the model. One hundred selected texts including pol...

متن کامل

An Improvement in Support Vector Machines Algorithm with Imperialism Competitive Algorithm for Text Documents Classification

Due to the exponential growth of electronic texts, their organization and management requires a tool to provide information and data in search of users in the shortest possible time. Thus, classification methods have become very important in recent years. In natural language processing and especially text processing, one of the most basic tasks is automatic text classification. Moreover, text ...

متن کامل

Improvement of generative adversarial networks for automatic text-to-image generation

This research is related to the use of deep learning tools and image processing technology in the automatic generation of images from text. Previous researches have used one sentence to produce images. In this research, a memory-based hierarchical model is presented that uses three different descriptions that are presented in the form of sentences to produce and improve the image. The proposed ...

متن کامل

Exploring Multidimensional Continuous Feature Space to Extract Relevant Words

With growing amounts of text data the descriptive metadata become more crucial in efficient processing of it. One kind of such metadata are keywords, which we can encounter e.g. in everyday browsing of webpages. Such metadata can be of benefit in various scenarios, such as web search or contentbased recommendation. We research keyword extraction problem from the perspective of vector space and ...

متن کامل

Feeler: Emotion Classification of Text Using Vector Space Model

Over the last quarter-century, there is increasing body of research on understanding the human emotions. In this study, automatic classification of anger, disgust, fear, joy and sad emotions in text have been studied on the ISEAR (International Survey on Emotion Antecedents and Reactions) dataset. For the classification we have used Vector Space Model with a total of 801 news headlines provided...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995